Faking it: Synthetic Text-to-speech Synthesis for Under-resourced Languages - Experimental Design
نویسنده
چکیده
Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of the world’s ‘minor’ languages no such technology is available. While awaiting the development of such technology, we would like to try the stop-gap solution of using an existing TTS system for a major language (the base language) to ‘fake’ TTS for a minor language (the target language). This paper describes the design for an experiment which involves finding a suitable base language for the Australian Aboriginal language Pitjantjajara as a target language, and evaluating its usability in the real-life situation of providing language technology support for speakers of the target language whose understanding of the local majority language is limited, for example in the scenario of going to the doctor.
منابع مشابه
Developing Speech Synthesis for Under-Resourced Languages by "Faking it": An Experiment with Somali
Speech synthesis or text-to-speech (TTS) systems are currently available for a number of the world’s major languages, but for thousands of other, unsupported, languages no such technology is available. While awaiting the development of such technology, we propose using an existing TTS system for a major language (the base language, BL) to “fake” TTS for an unsupported language (the target langu...
متن کاملAutomatic Speech Recognition for Under-Resourced Languages:
Speech processing for under-resourced languages is an active field of research, which has experienced significant progress during the past decade. We propose, in this paper, a survey that focuses on automatic speech recognition (ASR) for these languages. The definition of under-resourced languages and the challenges associated to them are first defined. The main part of the paper is a literatur...
متن کاملTowards automatic cross-lingual acoustic modelling applied to HMM-based speech synthesis for under-resourced languages
Nowadays Human Computer Interaction (HCI) can also be achieved with voice user interfaces (VUIs). To enable devices to communicate with humans by speech in the user’s own language, low-cost language portability is often discussed and analysed. One of the most time-consuming parts for the language-adaptation process of VUIcapable applications is the target-language speech-data acquisition. Such ...
متن کاملPhonetic alignment for speech synthesis in under-resourced languages
The rapid development of concatenative speech synthesis systems in resource scarce languages requires an efficient and accurate solution with regard to automated phonetic alignment. However, in this context corpora are often minimally designed due to a lack of resources and expertise necessary for large scale development. Under these circumstances many techniques toward accurate segmentation ar...
متن کاملSpeech data collection in an under-resourced language within a multilingual context
In this paper, we present an end-to-end solution to the development of an automatic speech recognition (ASR) system in typical under-resourced languages, where the target language is likely to be influenced by one more embedded foreign languages. We first describe the collection and processing of the text corpus crawled from the World Wide Web using the Rapid Language Adaptation Toolkit. In par...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005